Overview

Brought to you by YData

Dataset statistics

Number of variables9
Number of observations1108612
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory84.6 MiB
Average record size in memory80.0 B

Variable types

DateTime1
Categorical7
Numeric1

Alerts

type_of_work has constant value "Full-time employees"Constant
uom is highly overall correlated with value and 1 other fieldsHigh correlation
value is highly overall correlated with uom and 1 other fieldsHigh correlation
wages is highly overall correlated with uom and 1 other fieldsHigh correlation

Reproduction

Analysis started2024-10-09 17:26:33.783754
Analysis finished2024-10-09 17:26:51.422355
Duration17.64 seconds
Software versionydata-profiling vv4.10.0
Download configurationconfig.json

Variables

Distinct324
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size16.9 MiB
Minimum1997-01-01 00:00:00
Maximum2023-12-01 00:00:00
2024-10-09T13:26:51.557612image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
2024-10-09T13:26:51.709718image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

geo
Categorical

Distinct11
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size16.9 MiB
Canada
130827 
Ontario
123608 
Quebec
115706 
British Columbia
107581 
Manitoba
107071 
Other values (6)
523819 

Length

Max length25
Median length16
Mean length10.97628
Min length6

Characters and Unicode

Total characters12168436
Distinct characters32
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowCanada
2nd rowCanada
3rd rowCanada
4th rowCanada
5th rowCanada

Common Values

ValueCountFrequency (%)
Canada 130827
11.8%
Ontario 123608
11.1%
Quebec 115706
10.4%
British Columbia 107581
9.7%
Manitoba 107071
9.7%
Alberta 104679
9.4%
Saskatchewan 99583
9.0%
New Brunswick 89997
8.1%
Nova Scotia 88693
8.0%
Newfoundland and Labrador 71089
6.4%

Length

2024-10-09T13:26:51.854007image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
canada 130827
 
7.8%
ontario 123608
 
7.4%
quebec 115706
 
6.9%
british 107581
 
6.4%
columbia 107581
 
6.4%
manitoba 107071
 
6.4%
alberta 104679
 
6.2%
saskatchewan 99583
 
5.9%
brunswick 89997
 
5.4%
new 89997
 
5.4%
Other values (8) 599987
35.8%

Most occurring characters

ValueCountFrequency (%)
a 1842538
15.1%
n 903909
 
7.4%
i 801890
 
6.6%
r 707599
 
5.8%
e 666538
 
5.5%
o 657824
 
5.4%
t 631215
 
5.2%
d 624517
 
5.1%
568005
 
4.7%
b 506126
 
4.2%
Other values (22) 4258275
35.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 12168436
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
a 1842538
15.1%
n 903909
 
7.4%
i 801890
 
6.6%
r 707599
 
5.8%
e 666538
 
5.5%
o 657824
 
5.4%
t 631215
 
5.2%
d 624517
 
5.1%
568005
 
4.7%
b 506126
 
4.2%
Other values (22) 4258275
35.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 12168436
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
a 1842538
15.1%
n 903909
 
7.4%
i 801890
 
6.6%
r 707599
 
5.8%
e 666538
 
5.5%
o 657824
 
5.4%
t 631215
 
5.2%
d 624517
 
5.1%
568005
 
4.7%
b 506126
 
4.2%
Other values (22) 4258275
35.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 12168436
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
a 1842538
15.1%
n 903909
 
7.4%
i 801890
 
6.6%
r 707599
 
5.8%
e 666538
 
5.5%
o 657824
 
5.4%
t 631215
 
5.2%
d 624517
 
5.1%
568005
 
4.7%
b 506126
 
4.2%
Other values (22) 4258275
35.0%

wages
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size16.9 MiB
Average hourly wage rate
587455 
Total employees, all wages
521157 

Length

Max length26
Median length24
Mean length24.940197
Min length24

Characters and Unicode

Total characters27649002
Distinct characters19
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowTotal employees, all wages
2nd rowTotal employees, all wages
3rd rowTotal employees, all wages
4th rowTotal employees, all wages
5th rowTotal employees, all wages

Common Values

ValueCountFrequency (%)
Average hourly wage rate 587455
53.0%
Total employees, all wages 521157
47.0%

Length

2024-10-09T13:26:52.009105image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-10-09T13:26:52.129407image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
ValueCountFrequency (%)
average 587455
13.2%
hourly 587455
13.2%
wage 587455
13.2%
rate 587455
13.2%
total 521157
11.8%
employees 521157
11.8%
all 521157
11.8%
wages 521157
11.8%

Most occurring characters

ValueCountFrequency (%)
e 4434448
16.0%
a 3325836
12.0%
3325836
12.0%
l 2672083
9.7%
r 1762365
 
6.4%
g 1696067
 
6.1%
o 1629769
 
5.9%
w 1108612
 
4.0%
t 1108612
 
4.0%
y 1108612
 
4.0%
Other values (9) 5476762
19.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 27649002
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 4434448
16.0%
a 3325836
12.0%
3325836
12.0%
l 2672083
9.7%
r 1762365
 
6.4%
g 1696067
 
6.1%
o 1629769
 
5.9%
w 1108612
 
4.0%
t 1108612
 
4.0%
y 1108612
 
4.0%
Other values (9) 5476762
19.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 27649002
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 4434448
16.0%
a 3325836
12.0%
3325836
12.0%
l 2672083
9.7%
r 1762365
 
6.4%
g 1696067
 
6.1%
o 1629769
 
5.9%
w 1108612
 
4.0%
t 1108612
 
4.0%
y 1108612
 
4.0%
Other values (9) 5476762
19.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 27649002
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 4434448
16.0%
a 3325836
12.0%
3325836
12.0%
l 2672083
9.7%
r 1762365
 
6.4%
g 1696067
 
6.1%
o 1629769
 
5.9%
w 1108612
 
4.0%
t 1108612
 
4.0%
y 1108612
 
4.0%
Other values (9) 5476762
19.8%

type_of_work
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size16.9 MiB
Full-time employees
1108612 

Length

Max length19
Median length19
Mean length19
Min length19

Characters and Unicode

Total characters21063628
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowFull-time employees
2nd rowFull-time employees
3rd rowFull-time employees
4th rowFull-time employees
5th rowFull-time employees

Common Values

ValueCountFrequency (%)
Full-time employees 1108612
100.0%

Length

2024-10-09T13:26:52.239621image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-10-09T13:26:52.332881image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
ValueCountFrequency (%)
full-time 1108612
50.0%
employees 1108612
50.0%

Most occurring characters

ValueCountFrequency (%)
e 4434448
21.1%
l 3325836
15.8%
m 2217224
10.5%
F 1108612
 
5.3%
u 1108612
 
5.3%
- 1108612
 
5.3%
t 1108612
 
5.3%
i 1108612
 
5.3%
1108612
 
5.3%
p 1108612
 
5.3%
Other values (3) 3325836
15.8%

Most occurring categories

ValueCountFrequency (%)
(unknown) 21063628
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 4434448
21.1%
l 3325836
15.8%
m 2217224
10.5%
F 1108612
 
5.3%
u 1108612
 
5.3%
- 1108612
 
5.3%
t 1108612
 
5.3%
i 1108612
 
5.3%
1108612
 
5.3%
p 1108612
 
5.3%
Other values (3) 3325836
15.8%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 21063628
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 4434448
21.1%
l 3325836
15.8%
m 2217224
10.5%
F 1108612
 
5.3%
u 1108612
 
5.3%
- 1108612
 
5.3%
t 1108612
 
5.3%
i 1108612
 
5.3%
1108612
 
5.3%
p 1108612
 
5.3%
Other values (3) 3325836
15.8%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 21063628
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 4434448
21.1%
l 3325836
15.8%
m 2217224
10.5%
F 1108612
 
5.3%
u 1108612
 
5.3%
- 1108612
 
5.3%
t 1108612
 
5.3%
i 1108612
 
5.3%
1108612
 
5.3%
p 1108612
 
5.3%
Other values (3) 3325836
15.8%
Distinct45
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size16.9 MiB
Sales and service support occupations [65]
 
39205
Sales and service representatives and other customer and personal services occupations [64]
 
38082
Sales and service occupations, except management [62-65]
 
36985
Business, finance and administration occupations, except management [11-14]
 
36462
Occupations in education, law and social, community and government services, except management [41-45]
 
34876
Other values (40)
923002 

Length

Max length140
Median length80
Mean length72.011207
Min length19

Characters and Unicode

Total characters79832488
Distinct characters55
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]
2nd rowManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]
3rd rowManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]
4th rowManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]
5th rowManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]

Common Values

ValueCountFrequency (%)
Sales and service support occupations [65] 39205
 
3.5%
Sales and service representatives and other customer and personal services occupations [64] 38082
 
3.4%
Sales and service occupations, except management [62-65] 36985
 
3.3%
Business, finance and administration occupations, except management [11-14] 36462
 
3.3%
Occupations in education, law and social, community and government services, except management [41-45] 34876
 
3.1%
Administrative and financial support and supply chain logistics occupations [14] 34525
 
3.1%
Occupations in manufacturing and utilities, except management [92-95] 32653
 
2.9%
Trades, transport and equipment operators and related occupations, except management [72-75] 32236
 
2.9%
Occupations in sales and services [63] 31611
 
2.9%
Retail sales and service supervisors and specialized occupations in sales and services [62] 31475
 
2.8%
Other values (35) 760502
68.6%

Length

2024-10-09T13:26:52.455942image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
and 1473945
 
14.4%
occupations 957507
 
9.4%
in 537871
 
5.3%
management 388040
 
3.8%
except 286286
 
2.8%
services 264513
 
2.6%
sales 208833
 
2.0%
related 187534
 
1.8%
natural 150761
 
1.5%
service 145747
 
1.4%
Other values (132) 5609434
54.9%

Most occurring characters

ValueCountFrequency (%)
9101859
 
11.4%
a 6545669
 
8.2%
e 6210650
 
7.8%
n 6175647
 
7.7%
i 5117895
 
6.4%
s 5114190
 
6.4%
t 4778595
 
6.0%
c 4392545
 
5.5%
o 4377941
 
5.5%
r 3921978
 
4.9%
Other values (45) 24095519
30.2%

Most occurring categories

ValueCountFrequency (%)
(unknown) 79832488
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
9101859
 
11.4%
a 6545669
 
8.2%
e 6210650
 
7.8%
n 6175647
 
7.7%
i 5117895
 
6.4%
s 5114190
 
6.4%
t 4778595
 
6.0%
c 4392545
 
5.5%
o 4377941
 
5.5%
r 3921978
 
4.9%
Other values (45) 24095519
30.2%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 79832488
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
9101859
 
11.4%
a 6545669
 
8.2%
e 6210650
 
7.8%
n 6175647
 
7.7%
i 5117895
 
6.4%
s 5114190
 
6.4%
t 4778595
 
6.0%
c 4392545
 
5.5%
o 4377941
 
5.5%
r 3921978
 
4.9%
Other values (45) 24095519
30.2%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 79832488
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
9101859
 
11.4%
a 6545669
 
8.2%
e 6210650
 
7.8%
n 6175647
 
7.7%
i 5117895
 
6.4%
s 5114190
 
6.4%
t 4778595
 
6.0%
c 4392545
 
5.5%
o 4377941
 
5.5%
r 3921978
 
4.9%
Other values (45) 24095519
30.2%

sex
Categorical

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size16.9 MiB
Males
581315 
Females
527297 

Length

Max length7
Median length5
Mean length5.9512742
Min length5

Characters and Unicode

Total characters6597654
Distinct characters7
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowMales
2nd rowMales
3rd rowMales
4th rowMales
5th rowMales

Common Values

ValueCountFrequency (%)
Males 581315
52.4%
Females 527297
47.6%

Length

2024-10-09T13:26:52.596741image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-10-09T13:26:52.708994image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
ValueCountFrequency (%)
males 581315
52.4%
females 527297
47.6%

Most occurring characters

ValueCountFrequency (%)
e 1635909
24.8%
a 1108612
16.8%
l 1108612
16.8%
s 1108612
16.8%
M 581315
 
8.8%
F 527297
 
8.0%
m 527297
 
8.0%

Most occurring categories

ValueCountFrequency (%)
(unknown) 6597654
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
e 1635909
24.8%
a 1108612
16.8%
l 1108612
16.8%
s 1108612
16.8%
M 581315
 
8.8%
F 527297
 
8.0%
m 527297
 
8.0%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 6597654
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
e 1635909
24.8%
a 1108612
16.8%
l 1108612
16.8%
s 1108612
16.8%
M 581315
 
8.8%
F 527297
 
8.0%
m 527297
 
8.0%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 6597654
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
e 1635909
24.8%
a 1108612
16.8%
l 1108612
16.8%
s 1108612
16.8%
M 581315
 
8.8%
F 527297
 
8.0%
m 527297
 
8.0%

age_group
Categorical

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size16.9 MiB
25 to 54 years
498529 
55 years and over
339489 
15 to 24 years
270594 

Length

Max length17
Median length14
Mean length14.918687
Min length14

Characters and Unicode

Total characters16539035
Distinct characters15
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row15 to 24 years
2nd row15 to 24 years
3rd row15 to 24 years
4th row15 to 24 years
5th row15 to 24 years

Common Values

ValueCountFrequency (%)
25 to 54 years 498529
45.0%
55 years and over 339489
30.6%
15 to 24 years 270594
24.4%

Length

2024-10-09T13:26:52.823120image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-10-09T13:26:52.927194image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
ValueCountFrequency (%)
years 1108612
25.0%
to 769123
17.3%
25 498529
11.2%
54 498529
11.2%
55 339489
 
7.7%
and 339489
 
7.7%
over 339489
 
7.7%
15 270594
 
6.1%
24 270594
 
6.1%

Most occurring characters

ValueCountFrequency (%)
3325836
20.1%
5 1946630
11.8%
e 1448101
8.8%
a 1448101
8.8%
r 1448101
8.8%
o 1108612
 
6.7%
y 1108612
 
6.7%
s 1108612
 
6.7%
2 769123
 
4.7%
t 769123
 
4.7%
Other values (5) 2058184
12.4%

Most occurring categories

ValueCountFrequency (%)
(unknown) 16539035
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
3325836
20.1%
5 1946630
11.8%
e 1448101
8.8%
a 1448101
8.8%
r 1448101
8.8%
o 1108612
 
6.7%
y 1108612
 
6.7%
s 1108612
 
6.7%
2 769123
 
4.7%
t 769123
 
4.7%
Other values (5) 2058184
12.4%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 16539035
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
3325836
20.1%
5 1946630
11.8%
e 1448101
8.8%
a 1448101
8.8%
r 1448101
8.8%
o 1108612
 
6.7%
y 1108612
 
6.7%
s 1108612
 
6.7%
2 769123
 
4.7%
t 769123
 
4.7%
Other values (5) 2058184
12.4%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 16539035
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
3325836
20.1%
5 1946630
11.8%
e 1448101
8.8%
a 1448101
8.8%
r 1448101
8.8%
o 1108612
 
6.7%
y 1108612
 
6.7%
s 1108612
 
6.7%
2 769123
 
4.7%
t 769123
 
4.7%
Other values (5) 2058184
12.4%

uom
Categorical

HIGH CORRELATION 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size16.9 MiB
Current dollars
587455 
Persons
521157 

Length

Max length15
Median length15
Mean length11.239211
Min length7

Characters and Unicode

Total characters12459924
Distinct characters13
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st rowPersons
2nd rowPersons
3rd rowPersons
4th rowPersons
5th rowPersons

Common Values

ValueCountFrequency (%)
Current dollars 587455
53.0%
Persons 521157
47.0%

Length

2024-10-09T13:26:53.055454image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-10-09T13:26:53.161204image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
ValueCountFrequency (%)
current 587455
34.6%
dollars 587455
34.6%
persons 521157
30.7%

Most occurring characters

ValueCountFrequency (%)
r 2283522
18.3%
s 1629769
13.1%
l 1174910
9.4%
e 1108612
8.9%
n 1108612
8.9%
o 1108612
8.9%
C 587455
 
4.7%
u 587455
 
4.7%
t 587455
 
4.7%
587455
 
4.7%
Other values (3) 1696067
13.6%

Most occurring categories

ValueCountFrequency (%)
(unknown) 12459924
100.0%

Most frequent character per category

(unknown)
ValueCountFrequency (%)
r 2283522
18.3%
s 1629769
13.1%
l 1174910
9.4%
e 1108612
8.9%
n 1108612
8.9%
o 1108612
8.9%
C 587455
 
4.7%
u 587455
 
4.7%
t 587455
 
4.7%
587455
 
4.7%
Other values (3) 1696067
13.6%

Most occurring scripts

ValueCountFrequency (%)
(unknown) 12459924
100.0%

Most frequent character per script

(unknown)
ValueCountFrequency (%)
r 2283522
18.3%
s 1629769
13.1%
l 1174910
9.4%
e 1108612
8.9%
n 1108612
8.9%
o 1108612
8.9%
C 587455
 
4.7%
u 587455
 
4.7%
t 587455
 
4.7%
587455
 
4.7%
Other values (3) 1696067
13.6%

Most occurring blocks

ValueCountFrequency (%)
(unknown) 12459924
100.0%

Most frequent character per block

(unknown)
ValueCountFrequency (%)
r 2283522
18.3%
s 1629769
13.1%
l 1174910
9.4%
e 1108612
8.9%
n 1108612
8.9%
o 1108612
8.9%
C 587455
 
4.7%
u 587455
 
4.7%
t 587455
 
4.7%
587455
 
4.7%
Other values (3) 1696067
13.6%

value
Real number (ℝ)

HIGH CORRELATION 

Distinct5247
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean16.286047
Minimum0.2
Maximum57.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size16.9 MiB
2024-10-09T13:26:53.276501image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Quantile statistics

Minimum0.2
5-th percentile0.9
Q15
median15.06
Q323.8
95-th percentile39.98
Maximum57.2
Range57
Interquartile range (IQR)18.8

Descriptive statistics

Standard deviation12.409786
Coefficient of variation (CV)0.76198885
Kurtosis0.047619406
Mean16.286047
Median Absolute Deviation (MAD)9.46
Skewness0.72827346
Sum18054907
Variance154.00279
MonotonicityNot monotonic
2024-10-09T13:26:53.419137image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.7 11745
 
1.1%
0.8 11213
 
1.0%
0.6 10601
 
1.0%
0.9 10237
 
0.9%
1 9442
 
0.9%
1.1 8544
 
0.8%
0.5 8286
 
0.7%
1.5 7950
 
0.7%
1.2 7733
 
0.7%
1.6 7672
 
0.7%
Other values (5237) 1015189
91.6%
ValueCountFrequency (%)
0.2 3621
 
0.3%
0.3 5081
0.5%
0.4 3998
 
0.4%
0.5 8286
0.7%
0.6 10601
1.0%
0.7 11745
1.1%
0.8 11213
1.0%
0.9 10237
0.9%
1 9442
0.9%
1.1 8544
0.8%
ValueCountFrequency (%)
57.2 91
< 0.1%
57.19 3
 
< 0.1%
57.18 5
 
< 0.1%
57.17 7
 
< 0.1%
57.16 3
 
< 0.1%
57.15 7
 
< 0.1%
57.14 7
 
< 0.1%
57.13 5
 
< 0.1%
57.12 8
 
< 0.1%
57.11 6
 
< 0.1%

Interactions

2024-10-09T13:26:47.631301image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/

Correlations

2024-10-09T13:26:53.513392image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
age_groupgeonational_occupational_classification_(noc)sexuomvaluewages
age_group1.0000.1280.1870.0240.0360.2080.036
geo0.1281.0000.0800.0360.0750.1040.075
national_occupational_classification_(noc)0.1870.0801.0000.2410.0720.1550.072
sex0.0240.0360.2411.0000.0000.1050.000
uom0.0360.0750.0720.0001.0000.7051.000
value0.2080.1040.1550.1050.7051.0000.705
wages0.0360.0750.0720.0001.0000.7051.000

Missing values

2024-10-09T13:26:48.117574image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
A simple visualization of nullity by column.
2024-10-09T13:26:50.023526image/svg+xmlMatplotlib v3.8.4, https://matplotlib.org/
Nullity matrix is a data-dense display which lets you quickly visually pick out patterns in data completion.

Sample

ref_dategeowagestype_of_worknational_occupational_classification_(noc)sexage_groupuomvalue
19441997-01CanadaTotal employees, all wagesFull-time employeesManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]Males15 to 24 yearsPersons26.4
19451997-02CanadaTotal employees, all wagesFull-time employeesManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]Males15 to 24 yearsPersons22.0
19461997-03CanadaTotal employees, all wagesFull-time employeesManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]Males15 to 24 yearsPersons18.6
19471997-04CanadaTotal employees, all wagesFull-time employeesManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]Males15 to 24 yearsPersons19.9
19481997-05CanadaTotal employees, all wagesFull-time employeesManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]Males15 to 24 yearsPersons22.4
19491997-06CanadaTotal employees, all wagesFull-time employeesManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]Males15 to 24 yearsPersons23.4
19501997-07CanadaTotal employees, all wagesFull-time employeesManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]Males15 to 24 yearsPersons24.0
19511997-08CanadaTotal employees, all wagesFull-time employeesManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]Males15 to 24 yearsPersons25.5
19521997-09CanadaTotal employees, all wagesFull-time employeesManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]Males15 to 24 yearsPersons21.8
19531997-10CanadaTotal employees, all wagesFull-time employeesManagement occupations [00, 10, 20, 30, 40, 50, 60, 70, 80, 90]Males15 to 24 yearsPersons17.4
ref_dategeowagestype_of_worknational_occupational_classification_(noc)sexage_groupuomvalue
19244792017-04British ColumbiaAverage hourly wage rateFull-time employeesLabourers in processing, manufacturing and utilities [95]Females55 years and overCurrent dollars17.83
19244832017-08British ColumbiaAverage hourly wage rateFull-time employeesLabourers in processing, manufacturing and utilities [95]Females55 years and overCurrent dollars16.72
19244972018-10British ColumbiaAverage hourly wage rateFull-time employeesLabourers in processing, manufacturing and utilities [95]Females55 years and overCurrent dollars15.77
19245502023-03British ColumbiaAverage hourly wage rateFull-time employeesLabourers in processing, manufacturing and utilities [95]Females55 years and overCurrent dollars23.57
19245522023-05British ColumbiaAverage hourly wage rateFull-time employeesLabourers in processing, manufacturing and utilities [95]Females55 years and overCurrent dollars18.82
19245542023-07British ColumbiaAverage hourly wage rateFull-time employeesLabourers in processing, manufacturing and utilities [95]Females55 years and overCurrent dollars20.70
19245552023-08British ColumbiaAverage hourly wage rateFull-time employeesLabourers in processing, manufacturing and utilities [95]Females55 years and overCurrent dollars19.95
19245562023-09British ColumbiaAverage hourly wage rateFull-time employeesLabourers in processing, manufacturing and utilities [95]Females55 years and overCurrent dollars20.21
19245572023-10British ColumbiaAverage hourly wage rateFull-time employeesLabourers in processing, manufacturing and utilities [95]Females55 years and overCurrent dollars20.11
19245582023-11British ColumbiaAverage hourly wage rateFull-time employeesLabourers in processing, manufacturing and utilities [95]Females55 years and overCurrent dollars21.02